A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News

نویسندگان

  • Kelwin Fernandes
  • Pedro Vinagre
  • Paulo Cortez
چکیده

Due to the Web expansion, the prediction of online news popularity is becoming a trendy research topic. In this paper, we propose a novel and proactive Intelligent Decision Support System (IDSS) that analyzes articles prior to their publication. Using a broad set of extracted features (e.g., keywords, digital media content, earlier popularity of news referenced in the article) the IDSS first predicts if an article will become popular. Then, it optimizes a subset of the articles features that can more easily be changed by authors, searching for an enhancement of the predicted popularity probability. Using a large and recently collected dataset, with 39,000 articles from the Mashable website, we performed a robust rolling windows evaluation of five state of the art models. The best result was provided by a Random Forest with a discrimination power of 73%. Moreover, several stochastic hill climbing local searches were explored. When optimizing 1000 articles, the best optimization method obtained a mean gain improvement of 15 percentage points in terms of the estimated popularity probability. These results attest the proposed IDSS as a valuable tool for online news authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing an intelligent system for predicting chromosomal genetic diseases using data mining

Background and Aim: Today we are witnessing tremendous advances in medical data mining. The data, by analyzing and discovering the relationships between them, can lead to algorithms that help us prevent or treat many diseases. Meanwhile, genetic diseases have attracted a large part of the attention of the medical world because the birth of children with genetic disorders imposes a great financi...

متن کامل

Intelligent and Online Evaluation of Diabetes using Wireless Sensor Networks and Support Vector Machines Algorithm

Objective: International Diabetes Organization estimates that there are 285 million people worldwide who suffer from diabetes, and this figure is expected to increase to 450 million in next 20 years. According to statistics issued by the World Health Organization, diabetes is considered among ten leading causes of death in world and its prevalence in the population is growing.This paper deals w...

متن کامل

Predicting and Evaluating the Popularity of Online News

Reading and sharing online news has become an important part of people’s entertainment lives. Therefore it would be greatly helpful if we could accurately predict the popularity of news prior to its publication for social media workers (authors, advertisers, etc.). Our goal is to predict the popularity of a news post (measured by number of shares) based on various features (see Table I.). In th...

متن کامل

AN INTELLIGENT INFORMATION SYSTEM FOR FUZZY ADDITIVE MODELLING (HYDROLOGICAL RISK APPLICATION)

In this paper we propose and construct Fuzzy Algebraic Additive Model, for the estimation of risk in various fields of human activities or nature’s behavior. Though the proposed model is useful in a wide range of scientific fields, it was designed for to torrential risk evaluation in the area of river Evros. Clearly the model’s performance improves when the number of parameters and the actual d...

متن کامل

Design and implementation of an intelligent clinical decision support system for diagnosis and prediction of chronic kidney disease

Introduction: Chronic kidney disease (CKD) is one of the most important public health concerns worldwide. The steady increase in the number of people with End-stage renal disease (ESRD) needing a kidney transplant to survive and incur high costs, highlights early diagnosis and treatment of the disease. This study aimed to design a Clinical Decision Support System (CDSS) for diagnosing CKD and p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015